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(54) Disk storage system having redundant solid state data storage devices 



(57) A RAID device is provided with a pair of non- 
volatile solid state data storage devices and one or more 
rotating disk drives, giving improved access time per- 



formance to the array. Data is staged on the pair of solid 
state data storage devices, and periodically backed up 
to the rotating disk drive(s). 
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Description 

Field of the Invention 



[0001] The present invention relates to data storaqe 
dev,ces for storage of electronic data, and particularly 
though not exclusively to a data storage device compris- 
ing at least one rotating disk drive, and one or more solid 
state data storage devices capable of substantially fast- 
er read and write operations than the disk drive. 

Background to the lnv* n r, nn 

[0002] Conventional data storage systems for high 
capacity data storage include conventional 'hard drive' 
disk dnve devices as are well-known in the art. Conven- 
tional disk drive comprise a rotating disk having a mag- 
netically readable and writeable coating, which revolves 
at high revolutions per minute. Such disk drives may 
store data capacities in the range 4 to 32 Gbytes or more 
for each individual drive. For greater storage capacities 
devices are arranged in arrays, for example in redun- 
dant arrays of inexpensive disks (RAID arrays) as are 
known in the art. Conventional hard disk drives have rel- 
atively slow access times for accessing data, compared 
to conventional solid state devices such as dynamic ran- 
dom access memories (DRAM). The relatively high ac- 
cess time for accessing data in a hard disk drive is pre- 
dominantly a result of moving a read head to read a par- 
ticular part of the rotating disk. 

[0003] Hard disk drives are extremely common due to 
their relativefy low price per unit data storage capacity. 
However, there are also known in the prior art solid state 
data storage devices having relatively lower densities of 
data storage, and having much faster access times than 
hard disk drives but with higher costs than hard disk 
drives. Such devices are currently niche market devic- 
es, due to their cost. These solid state memory devices 
have data storage capacities between those of hard disk 
dnves and conventional D RAM's. 
[0004] The applicants have developed a data storage 
system having a combination of rotating hard disk drive 
and solid state data storage devices in a form which is 
capable of low cost production and which may be suit- 
able for large scale mass market applications. Such sys- 
tems may provide a high performance data storage de- 
vice solution for replacement of conventional hard disk 
dnves in mass market applications. 
[0005] The applicants have studied how usera may 
wish to actually utilize the high performance solid state 
data storage devices in customer applications, and as 
a result of this study have identified various problems 
with the usage of these high perfoimance data storaqe 
devices. The applicants estimate that the optimum us- 
age of such high performance data storage devices 
would be in conjunction with one or more conventional 
hard disk dnves. For optimum technical utility, the appli- 
cants envisage that a plurality of high performance solid 
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state non volatile data storage devices will be used in 
conjunction with one or^a plurality of conventional hard 
disk drive devices, whereby more frequently accessed 
data ,s stored on the solid state data storage device 

rlrnTJ! 58 frequent, V access *d data is stored on a 
hard disk drive. 

[0006] A least cost solution is to provide a single high 
performance solid state data storage device in series 
with one or more rotating disk drives, where data is read 
and written through the solid state device, such that 
most frequently used data is cached within the solid 
state data storage device. 

[0007] Conventional RAID devices use a plurality of 
redundantly arranged disks such that data is encoded 
over a number of disks, so that an individual disk can 
fail, whilst the data is still recoverable from redundancy 
encoded data stored on the other disks. Introducing a 
single high performance solid state data storage device 
introduces a single point of failure into a data storage 
system. Any data which is solely stored on the solid state 
data storage device will be at risk of loss if the device 
should fail. 

[0008] A possible alternative solution to the problem 
of single point of failure using an arrangement of a high 
performance solid state data storage device and one or 
a plurality of hard disk drives, is to use two high perform- 
ance solid state data storage devjces wh . ch m . rror 

other in the data which each device stores. The pres- 
ence of duplicate copies of data removes the single 
point of failure. However, a disadvantage of this solution 
is that the cost of this system increases substantially due 
to the use of two high performance solid state data stor- 
age devices, but only the benefit of the storage capabil- 
ity of a single high performance solid state data storaqe 
device is achieved, the second high performance data 
storage device being wholly redundant. 

Summa ry of the Invention 

[0009] The present invention relies on the use of two 
high performance solid state data storage devices and 
at least one rotating disc data storage device operating 
under control of a controller device, and makes efficient 
utility of the available user data storage space in both 
high performance solid state data storage devices The 
result is the provision of a bulk data storage system hav- 
ing a fast write time, but with removal of any single point 
of failure in a data storage system, with the benefit of an 
enhanced effective reliability. More than half the data 
storage space on the combined plurality of high perform- 
ance data storage devices may be available for use for 
storing user data. 

[0010] One effect of the specific embodiments of the 
present invention may be to speed up access to user 
data stored in a data storage device system or subsys- 
tem comprising a plurality of disk drives. 
[001 1 ] In a conventional prior art volatile cache mem- 
ory, an objective is to provide rapid write times and rapid 
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read times, so that data which would otherwise be 
"stored o'n'a rotatihg v hard:disk, thereby incurring long ac- 
cess times, is instead stored in a fast access memory, 
reducing the access time for regularly accessed infor- 
mation. In contrast in the specific embodiments of the 
present invention, an objective is to provide a rapid write 
access to large volumes of memory, so that a write con- 
firmation signal can be rapidly returned to a host device, 
so that a processing in the host device is not delayed in 
processing data, by waiting for a write acknowledge- 
ment signal, thereby allowing the processor to carry out 
other data processing tasks sooner. A fast write process 
to large amounts of memory may be available to a host, 
using the presently disclosed data storage system. 
[001 2] Two or more high performance non volatile sol- 
id state data storage devices are arranged with at least 
one disk drive partition in a group. Each high perform- 
ance data storage device is logically divided into two 
partitions, the larger partition of each high performance 
data storage device being allocated to user data, and a 
smaller partition of each of high performance data stor- 
age device being allocated as a data staging area. The 
first data staging area in the first high performance data 
storage device is logically matched to the second user 
data area of the second high performance data storage 
device and vice versa, the second data staging area of 
the second high performance data storage device is log- 
ically matched to the first user data area of the first high 
performance data storage device. Hence, the disk drive 
partitions, which are part of the group, are used to cap- 
ture data from the data staging areas. Movement of data 
between the high performance solid state devices and 
the hard disk drive(s) is achieved under control of a con- 
troller device. 

[0013] In this specification, the term non volatile solid 
state data storage device is used to mean a data storage 
device having no moving parts, and which retains mem- 
ory of data for an extended period of days or months 
when external power is removed from the device. Such 
devices may include magnetic random access memory 
(MRAM) devices, dynamic random access memory 
, (DRAM) devices having a battery power supply, ferro- 
electric random access memory (FERAM) devices, and 
flash electrically programmable read only memory de- 
vices. Such devices may be implemented as discrete 
units, for example of a size and shape suitable for direct 
replacement of conventional rotating disk hard drives, 
or may be implemented as part of a controller device of 
a RAID, on a same chip or component board as such a 
controller device. 

[0014] In the data storage system of the present dis- 
closure, there can be rapidly acknowledged a quick ac- 
cess write of data to a non-volatile memory device, 
which represents an improvement on the write times 
and acknowledgement times for prior art non-volatile ro- 
tating disk data storage devices. 

[0015] In one mode of operation a write operation to 
the group routes the user data to be stored to the user 



data area of the first high performance data storage de- 
* vice. Additionally, a copy of the data is written to; the sec-v 
ond data staging area of the second high performance 
data storage device. Once this is complete, a host com- 
5 puter considers the transaction complete and is free to 
carry on with other functions. This results in a substan- 
tially shorter write time than a write time to a conven- 
tional hard disk drive. 

[0016] As a separate process, data stored in the sec- 
10 ond data staging area of the second high performance 
data storage device is copied to the disk drive partition 
of a conventional disk drive data storage device and de- 
leted from the second data staging area of the high per- 
formance data storage device. This frees up staging 
15 space for subsequent write operations to the second da- 
ta staging area of the second high performance data 
storage device and may allow the second size of the 
second data staging area to be substantially smaller 
than the user data space on the second high perform- 
20 ance data storage device. 

[0017] Read operations from the group are directed 
to a said user data area on the first or second high per- 
formance data storage device. 

[0018] Should a failure of a high performance data 
25 storage device occur, it is necessary to recover the orig- 
inal data onto a new high performance data storage de- 
vice. Once this is installed, all the user data may be cop- ' 
ied back into place from an archive which exists on the 
disk drive partition. 
30 [0019] According to a first aspect of the present inven- 
tion there is provided a data storage system comprising: 

at least one rotating disk data storage device; 

35 a plurality of solid state data storage devices, each 
having a data capacity less than a data capacity of 
said rotating disk data storage device; t • *- 

wherein each solid state data storage device is di- 
40 vided into a data staging area and a user data area; 
and 

said plurality of data storage devices are arranged 
such that a set of data is written to at least two of 
45 said plurality of solid state data storage devices; 
and 

data is moved from a said data staging area of a 
said solid state data storage device into said rotat- 
50 jng disk data storage device. 

[0020] Preferably a said solid state data storage de- 
vice comprises a controller device comprising a proces- 
sor and a computer program for controlling said proces- 
55 sor, said controller device operating to: 

receive data from a host device; 
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write said data to said plurality of solid state data 
•■- storage devices,- such that a copy of said data is— * 
stored on each said solid state data storage device *" " 
and ' 

transfer data from a said data staging area of at 
least one said solid state data storage device to a 
data partition area of said at least one rotating disk 
data storage device. 



[0021] A said solid state data storage device may 
rn° n T' Se 9 ma 9 netic rand °™ access memory device 
[0022J Said solid state data storage device may com- 
pnse at least one intrinsically volatile dynamic random 
access memory, and at least one battery device, provid- 
ing^ combined non-volatile solid state data storage de- 

[0023] Said solid state data storage device may com- 
pnsea non-volatile flash electrically programmable read 
only memory. 

[0024] Said solid state data storage device may com- 
mon? n° G J eCM ° rand ° m access mem °y <*«"«>. 
[0025] Preferably said solid state data storage devic- 
es have a memory capacity of greater than 1 Gbyte 
[0026] Preferably said solid state data storage devic- 
es are non-volatile devices. 

2 3 SaW data Sta 9 ,n 9 area comprises 

less than 50% of the data storage capacity of a corre- 
sponding said solid state data storage device 
[0028] Preferably the data storage system comprises 
a controller device comprising a processor and a com- 
puter program for controlling said processor, and a plu- 
rality of data buses connecting said controller with said 
plurality of solid state data storage devices; wherein 
each sa , d solid state data storage device communicates 
with said controller device by a corresponding respec- 
tive separate said bus. 

[0029] According to a second aspect of the present 
invention -there is provided a method of storing data in 
a data storage system comprising a plurality of rotating 
disk drive data storage devices and a plurality of solid 
state data storage devices, said method comprising the 

Steps Of.' 
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writing a second block of data to a first data staging 
?area-ofsaid first solid state; data storage, device; 

writing said same second block of data to a second 
user data storage area of said second solid state 
data storage device; and 

writing said second block of data from said first data 
staging area of said first solid state data storage de- 
vice to a second partition memory area of said plu- 
rality of rotating disk drives. 

[0031] Said step of writing data to said second data 
staging area and said first user data area may comprise: 

(i) receiving said first block of data in a plurality of 
data segments : each data segment being smaller 
than said first block of data; 

(ii) writing a said segment to said second data stag- 
ing area and writing said data segment to said first 
user data area substantially simultaneously; 

(iii) after writing said segment to said second data 
staging area and said first user data area, selecting 
a new segment; 

(iv) repeating steps (ii) to (iii) until all said segments 
of said data first block are written; and 

(v) generating a confirmation signal for confirming 
that said first data block has been written. 



35 



40 



Preferably said method comprises the steps 



writing a first block of data to a first user data area 
in a first said solid state data storage device; 

writing said first block of data to a second data stag- 
mg area in a second said solid state data storage 
device; a 

writing said data stored in said second data staging 
area of said second solid state data storage device 
to a f.rst data partition area in said plurality of rotat- 
ing disk drives. 

[0030] The method may further comprise the steps of: 
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[0032] 

of: 

if new data has been written to said second data 
staging area, copying data of said second data staging 
area to said first data partition area of said plurality of 
rotating disks. H y 

[0033] Preferably said method comprises the steps 

determining whether said second data staging area 
is available for reading; 

determining whether unimpeded access to said first 
data partition area of said plurality of rotating disks 
is available; and 

moving data of said second data staging area to 
said first data partition area of said plurality of rotat- 
ing disks. 

[0034] Said step of writing data to said first data stag- 
ing area and said second user data area may comprise: 

(vi) receiving said second block of data in a plurality 
of datasegments, each data segment being smaller 
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than said second block of data; 

: S.V f . - J. .. i r. ' Cijffa: *- r . ■ . " •■ TV** Win'- - ' . ■. -v. :»c-:,: ■» ■ , . •" 

(vii) writing a said segment to said first data staging 
area and writing said segment to said second user 
data area substantially simultaneously; 

(viii) after writing said segment to said first data 
staging area and said second user data area, se- 
lecting a new segment; 

(ix) repeating steps (vii) to (viii) until all said seg- 
ments of said second data block are written; and 

(x) generating a confirmation signal for confirming 
that said second data block has been written. 

[0035] Preferably said method comprises the steps 

of: 

if new data has been written to said first data stag- 
ing area, moving said data of said first data staging area 
to a second data partition area of said plurality of disks. 
[0036] Preferably said method comprises the steps 
of: 

if new data has been written to said first data stag- 
ing area, moving data of said first data staging area to 
said second data partition area of said plurality of rotat- 
■ ing disks. 

[0037] Preferably said method comprises the steps 
of: 

determining whether new data has been written into 
a first data staging area; 

determining whether said first data staging area is 
available for reading; 

determining whether unimpeded access to a sec- 
ond data partition area of said plurality of rotating 
disks is available; and 

moving data of said first data staging area to said 
second data partition area of said plurality of rotat- 
ing disks. 

Brief Description of the Drawings 

[0038] For a better understanding of the invention and 
to show how the same may be carried into effect, there 
will now be described by way of example only, specific 
embodiments , methods and processes according to the 
present invention with reference to the accompanying 
drawings in which: 

Fig. 1 illustrates schematically a logical layout of a 
data storage system, cooperating with a host device 
for storage of data; 

Fig. 2 illustrates schematically an internal hardware 



architecture of a data storage system according to 
a best mode specific implementation of the present " 
invention; 

5 Fig. 3 illustrates schematically a first data storage 

operation and a partitioning of a memory area of a 
plurality of high performance solid state data stor- 
age devices, and a plurality of rotating disk drives 
in a best mode implementation according to the 

10 present invention; 

Fig. 4 illustrates schematically steps carried out by 
a control device comprising the data storage sys- 
tem, for carrying out a first write process for writing 
15 data received from a host device; 

Fig. 5 illustrates schematically a first back-up proc- 
ess controlled by a control device of the data stor- 
. age system, for backing up data stored on a high 
20 performance solid state data storage device to one 
or more rotating disk drives; 

Fig. 6 illustrates schematically a second data stor- 
age operation of said data storage system in which 

25 a host computer writes data to a first data staging 
area of a first solid state data storage device and a 
second user data area of a second solid state stor- 
age device, and data in said first data staging area 
is backed up to a first partition area of a plurality of 

30 rotating disk drives; u y 

Fig. 7 illustrates schematically a second write proc- 
ess carried out by a controller device of said* data 
storage system for writing data blocks to a first data 
35 staging area of a first solid state data storage device 
and to a second user data area of a second solid 
state data storage device; and . 4 

Fig. 8 illustrates schematically a second back up 
40 process for backing up of data in a first data staging 
area of a first solid state data storage device to a 
first data partition area of a plurality of rotating disk 
drives. 

« Detailed Description of the Best Mode for Carrying 
Out the Invention 

[0039] There will now be described by way of example 
the best mode contemplated by the inventors for carry- 
so ing out the invention. In the following description numer- 
ous specific details are set forth in order to provide a 
thorough understanding of the present invention, ft will 
be apparent however, to one skilled in the art, that the 
present invention may be practiced without limitation to 
55 these specific details. In other instances, well known 
methods and structures have not been described in de- 
tail so as not to unnecessarily obscure the present in- 
vention. 
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[0040] Referring to Fig. 1 herein, there is illustrated 
schematically a logical'diagram illustrating; a data .storT 
age system according to a first specific implementation 
of the present invention. A controller means 103 re- 
ceives data from a host computer 1 00 and can read and 
write to a first memory area 1 01 of relatively slow access 
time memory, provided by one or a plurality of rotating 
hard disk drives. The controller means 103 can also 
read and write data from one or a plurality of faster ac- 
cess memory areas provided by non-volatile solid state 
data storage devices 1 02, having a relatively faster ac- 
cess time than the hard disk data storage devices. Data 
is transferred between the hard disk data storage area 
1 01 and the solid state data storage area 1 02 under con- 
trol of the controller means 1 03, which is typically imple- 
mented as a computer program and processor. A hard 
disk partition memory area 1 04 is used to regularly back 
up data from the solid state data storage device memory 
area to provide recovery of data in the event of a failure 
of either solid state data storage device. 
[0041] Selected data is moved from the rotating disk 
data storage devices onto the solid state data storage 
devices. The data selected is suitably the most frequent- 
ly accessed data. Data which is accessed less frequent- 
ly, may be stored on the rotating hard disk data storage 
devices, where access times are slower. Therefore, for 
the most frequently accessed data read/write times are 
improved compared to the conventional case using ro- 
tating disk data storage devices. However to achieve a 
cost effective solution, conventional rotating hard disk 
drives are retained in the data storage system for stor- 
age of bulk user data which is relatively less frequently 
accessed. For a host computer using the data storage 
system, there is an overall improvement in data storage 
access times for much of the data which is read and writ- 
ten to the data storage system, compared to a prior art 
data storage system having solely rotating hard disk 
drives. 

[0042] Referring to Fig. 2 herein, there is illustrated a 
physical architecture of a data storage system 200, 
comprising an array of one or a plurality of rotating disk 
data storage devices 201 -204; two or more non-volatile 
solid state high capacity memory data storage devices 
205, 206; and a controller device 207 for managing stor- 
age of data amongst the one or plurality of rotating disk 
data storage devices and the plurality of solid state data 
storage devices. A host device 208 reads and writes da- 
ta from and to the data storage system 200, via a known 
small computer system interface (SCSI) cable, or a 
known Fiber Channel Association cable and interface 
210. 

[0043] The data storage system of Fig. 2 may com- 
prise a legacy redundant array of inexpensive disks 
(RAID) device, with the addition of the plurality of non- 
volatile solid state data storage devices 205, 206 and 
associated controller 207 for controlling transfer of data 
between the rotating disk drives and the solid state 
memory devices. Each solid state data storage device 



may externally look very similar to a rotating hard disk 
: data storage; device;,, being of a similar size, shape ;and-.& 
weight. However, the solid state data storage device 
may have read and write times of the order of 1 000 times 
5 quicker than the conventional rotating disk hard drive 
unit. The solid state data storage devices may typically 
have a data storage capacity of one to two Gbytes. 
[0044] In a best mode implementation, the solid state 
data storage devices comprise Magnetic Random Ac- 
10 cess Memory (MRAM) data storage devices. These de- 
vices are non-volatile, and have relatively fast read and 
write access times compared to conventional rotating 
disk hard drive units. The M RAM devices are intrinsical- 
ly non-volatile, that is to say when electrical power sup- 
's ply is removed from the devices, the data is retained in 
the memory and is not lost. 

[0045] However, in an alternative implementation, the 
solid state of data storage devices may comprise flash 
EPROMS, ferro electric random access memories 
20 (FERAM), or Dynamic Random Access Memory 
(DRAM) arrays. In the case of a DRAM unit, DRAM de- 
vices are intrinsically volatile, and lose their stored data 
when power is removed. In order to make a non-volatile 
solid state data storage device, a combination of an ar- 
2 5 ray of DRAM devices, and a battery power supply in a 
casing is provided. Although DRAM's provide better per- 
formance in terms of read and write access times than 
a comparable MRAM unit, there is the disadvantage of 
the need to provide a battery back-up to overcome the 
30 intrinsic volatility of D RAM devices to provide a non-vol- 
atile DRAM data storage unit. 

[0046] On the other hand, flash EPROM's are intrin- 
sically non-volatile, but have poorer read and write ac- 
cess times than a comparable MRAM device. 
35 [0047] FERAM devices are intrinsically non volatile, 

and have access times comparable with those of MRAM 

devices, but have much lower storage capacity densi- 
ties. 

[0048] MRAM data storage units, DRAM datastorage 
^0 units and EPROM datastorage units are alternative em- 
bodiments, but in the best mode the M RAM data storage 
unit is selected to give the optimum combination of in- 
trinsic non-volatility, and fast read and write access 
times. 

45 [0049] Controller device 207 comprises a local proc- 
essor 209 having the task of managing data transfers 
between the first and second solid state data storage 
devices and the one or more hard disk drives, and man- 
aging storage of data as between the plurality of hard 
so disk drives. The local processor operates in accordance 
with control software 211 , operating a plurality of man- 
agement algorithms for effecting the data transfers be- 
tween individual data storage devices in the data stor- 
age system 200. 
55 [0050] Where a plurality of rotating hard disk drives 
are used, user data may be striped across the plurality 
of rotating hard disk drives, such that blocks of user data 
are redundancy encoded using a known error correction 
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coding, for example a Reed Solomon coding, and each 
data'btocJt.feiStared across the plurality, of rotating disk; 
drives. In the event of a failure of any one disk drive, the 
data stored on that disk drive may be recoverable from 
the redundancy coding in the data stored on the other 
rotating disk drives. If any one disk fails, there is enough 
data on the remaining disks to reconstruct the lost data 
on the failed disk. A new replacement disk drive can be 
inserted to replace the failed disk drive and the lost data 
on the new disk drive can be reconstituted automatically 
from the redundancy coded data on the other disk 
drives. Thus, within the plurality of rotating disk drives 
there is no single point of failure. It is important that in- 
troduction of a solid state data storage device does not 
introduce a single point of failure. 
[0051] The provision of two or more solid state data 
storage devices overcomes the risk of a single point of 
failure, in the event that one of the solid state data stor- 
age devices should fail. Thus, a single point of failure in 
terms of hardware is avoided. The plurality of hard disk 
drives are augmented by the plurality of solid state data 
storage devices in a way such that the introduction of 
the solid state data storage devices do not introduce a 
single point of failure into the data storage system 200. 
[0052] However, to avoid loss of data in the event of 
a failure of a single solid state data storage device, there 
must be ensured that the data on each solid state data 
storage device is recoverable from elsewhere in the ar- 
ray of data storage devices. 

[0053] A simple mirror replication of the data stored 
on first solid state data storage device 205 onto second 
solid state data storage device 206 would achieve reli- 
ability. However, this is an inefficient solution, since 
there is incurred the cost of two solid state data storage 
devices, but the memory capacity of only one solid state 
data storage device is utilized. 

[0054] In the best mode implementation presented 
herein, it is aimed to avoid a single point of failure by 
introduction of two or more solid state data storage de- 
vices, whilst optimizing the relatively high performance 
data storage capacity provided by those devices for 
storage of user data. In a first configuration, the data 
storage system operates such that as files are retrieved 
from the one or more rotating disk data storage devices, 
they are copied to the solid state data storage devices 
205, 206 which act as a cache for those files. Various 
logical rules may be embedded in the controller device 
207 for determining when a file will be copied to the solid 
state data storage devices, to achieve optimum per- 
formance. The controller device may include one or a 
plurality of algorithms for determining which data files 
are the most used, and for determining whether these 
should be stored in the solid state memory data storage 
device, or overwritten in the solid state data storage de- 
vice. The solid state data storage devices act as a cache 
for the one or more rotating disk drives. 
[0055] The controller device 207 maintains a table da- 
ta containing entries of where each block of data has 



been stored. As data is moved from the solid state data 
V*f storage device-memory to. the rotating disk drive area, 
and back again from the rotating disk drive areas to the 
solid state data storage device areas, the table data is 
5 updated. In the event of a failure of either solid state 
data storage device, the controller can determine where 
a block of lost data is also copied either on the other 
solid state data storage device or on the plurality of ro- 
tating disk drives. 
w [0056] In a second configuration, the solid state data 
storage devices are configured as permanent user data 
storage areas to augment the storage areas of the one 
or more rotating hard disk drives, for permanent storage 
of bulk user data. 
15 [0057] Referring to Fig. 3 herein, there is illustrated 
schematically partitioning of data areas within the first 
and second solid state data storage devices 205, 206 
and movement of data between memory areas on the 
solid state data storage devices and the hard disk drive 
20 data storage devices. With two solid state data storage 
devices, full redundancy may be achieved, whilst utiliz- 
ing more than the data storage capacity of one solid 
state data storage device and achieving as far as pos- 
sible the combined data storage capacity of both solid 
25 state data storage devices. 

[0058] The first solid state data storage device 2 05 is 
partitioned logically such that there is a first main user 
data area 300, which forms the bulk of the memory ca- 
pacity of the device, and a first data staging area 301 . 
30 Similarly, the second solid state data storage deyice is 
partitioned into a second user data area 302, occupying 
the bulk of the memory area of that solid state data stor- 
age device, and a second data staging area 303.* In the 
best mode, each data staging area may constitute less 
35 than 50% of the total memory capacity of the .corre- 
sponding respective solid state data storage device. 
[0059] Whilst the best mode implementatiohYherein 
shows a separate processor 209 comprising control de- 
vice 207, in principle, the functions of the control device 
40 can be provided by software in the host device 208 itself. 
However, this has the disadvantage of taking up 
processing power from a processor in the host device 
208. The best mode implementation described herein is 
aimed at a mid-range data storage solution. The inven- 
ts tion is not limited to use of a separate processor at the 
control device 207, and the functionality of the control 
device 207 may be carried out using a processor in host 
device 208. 

[0060] Fig. 3 illustrates a first storage operation of a 
so host computer writing to the data storage system. The 
host computer may see the controller 207 of the data 
storage system as one logical drive unit, for example 
drive D. The controller handles data storage to all the 
data storage devices in the data storage system. The 
55 host computer, when it writes data to the data storage 
system, writes data to the controller 207, which then 
writes data to the first user data area 300 of the first solid 
state data storage device, and writes a copy of that 
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same data into the second data staging area 303 of the 
^sepondjSoHd^ wr it e op--t> " 

erations occur simultaneously. Bearing in mind that the 
second data staging area 303 has a significantly lower 
data capacity than the first user data area 300, if data s 
is continued to be written to the data storage system, 
the second data staging area 303 will be the first high 
performance data storage area to become full. There- 
fore, thecontroller device 207 moves the data in the sec- 
ond data staging area 303 into a first partitioned memory 10 
area 304 stored on the plurality of hard disk drives. The 
memory management strategy is to keep the second da- 
ta staging area 303 as empty as possible, by regularly 
transferring the data stored in that staging area onto the 
first partition area 304 in the plurality of disk drives 15 
201-204, as soon as possible after data is received in 
the second data staging area. 

[0061] In the best mode herein : the second data stag- 
ing area 303 is emptied on a first in-first out basis to the 
first hard disk data partition. However, in other imple- 20 
mentations, other algorithms for selecting the data to be 
emptied are possible. It is important to ensure that the 
second data staging area 303 does not overflow, result- 
ing in overwrite of data, before this data has been 
backed up onto the first data partition area 304 on the 25 
hard disk drives. If the second data staging area 303 is 
allowed to become full, then any write operations from 
the hostcomputerwill be held off, and the host computer 
could stall. 

[0062] Referring to Fig. 4 herein, there is illustrated 30 
schematically a first write process carried out by con- 
troller device 207 upon receiving a data block from host 
device 208, for storage in the data storage system; At 
all times, data written into the first solid state data stor- 
age device 205 must be replicated in the second solid 35 
state data storage device 206 : in case either of the solid 
state data storage devices should fail. In step 400, a 
block of data is received from the host device 208 by the 
controller device 207. In step 401 the block of data is 
written to the first user data area of the first solid state 40 
memory device. In step 402, the same data block is writ- 
ten to the second data staging- area of the second solid 
state memory device. In practice, steps 401, 402 may 
be carried out in interleaved manner, such that part of 
the data block is written to the first solid state storage 45 
device 205, followed by the same segment of data being 
written to the second solid state storage device 206, fol- 
lowed by a further segment written to the first said state 
data storage device, the further segment being written 
to the second solid state data storage device and so on so 
until the whole of the data block has been written to both 
solid state data storage devices segment by segment. 
Using a single processor 209, this operation therefore 
takes twice as long (i.e. about 20 u.s) as writing the data 
block to a single solid state data storage device. How- 55 
ever, since the write times to the solid state data storage 
device are of the order of 10 u.s, this is of the order of 
one thousand times faster than the write time to the con- 



ventional rotating disk drive, which is of the order of 10 
y ms'.' Therefore, doubling the- write..time:tb ithe.solid state 
data storage device is insignificant compared to the 
write time to a rotating disk drive device. 
[0063] However, the write speed can be reduced to 
the write time of a single solid state data storage device, 
if two processors are used in parallel to write the seg- 
ments of data to the first and second solid state storage 
devices simultaneously in parallel. In the best mode im- 
plementation, a single processor is used for reasons of 
cost reduction, and the small disadvantage in increased 
write time to the solid state data storage devices is ac- 
cepted. 

[0064] In step 404, once the control device 207 has 
written the complete data block to both solid state data 
storage devices, it sends a confirmation signal back to 
the host device 208, which the host device may use as 
a trigger to send a further data block for storage. 
[0065] Use of a high performance non-volatile solid 
state data storage device for receiving written data from 
a host computer allows a quick response time for send- 
ing the confirmation signal back to the host's computer, 
acknowledging that the data has been correctly written 
to memory. 

[0066] In contrast, prior art conventional volatile 
cache plus hard disk data storage devices operate on a 
write-through basis : in which data written to the cache 
is re-written on to the rotating hard disk drive before an 
acknowledgement signal is sent back to a host compu- 
ter writing the data. Inthisway, in the conventional write- 
through cache data storage device systems, it is en- 
sured that the data is written to non-volatile memory on 
the rotating disk, before an acknowledgement signal is 
sent back to the host device. Thus, if there is a power 
failure during a write-through operation: no acknowl- 
edgement signal will be received, andth'e data stored 
in volatile memory may be lost. The host device will then 
re-send the data, because the acknowledgement signal 
has not been received. However the conventional write- 
through cache data storage systems, incur the penalty 
of slow access time to the rotating hard disk drives. 
[0067] In contrast, in the presently disclosed system, 
an acknowledgement signal is received by a host device 
faster, because data is written to a quick access time 
high performance high capacity non-volatile solid state 
data storage device. 

[0068] This write time of approximately 20 us in the 
best mode herein compares with a prior art write time, 
taking example of a prior art 5-disk raid array, of the or- 
der of 40 ms. The prior art raid array has a relatively high 
write time because the data is striped across the 5-disks , 
and in order to check that the data has been written cor- 
rectly, the data must also be re-read from each of the 5 
disks by a control device. Therefore a significant write 
speed advantage is achieved by the best mode imple- 
mentation over prior art arrays. 

[0069] Referring to Fig, 5 herein, there is illustrated 
schematically a first back-up process of the data storage 
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system. In the first back up process, data in the second 
" data-staging area 303of the second solid state data stor- 
age device is transferred to the first partition area 304 
on the plurality of disks 201-204, as soon as the solid 
state data storage device is available to read from. The 
data in the second data staging area 303 is re- written to 
the first data partition area as soon as possible, in order 
to keep the second staging area 503 available for re- 
ceiving as large a quantity of write data as possible. This 
is in contrast to a conventional prior art volatile cache, 
where the cache is emptied as late as possible, in order 
to cache as large a quantity as data at once as possible. 
[0070] Second staging area 303 is used to transiently 
store data on its way to the first rotating disk partition 
area 304. The same data is also stored in the first user 
data area 300, so that if the second solid state data stor- 
age device fails at any time, data recovery is available 
from the first solid state data storage device. 
[0071] In step 500, the processor 209 under control 
of control software 211 checks whether any data has 
been recently written to the second data staging area 
303. if new data has been recently written to the second 
staging area, then in step 501 the controller checks 
whether the second staging area is available for read- 
ing, that is to say checks that the second solid state 
memory device is not busy, and checks in step 502 
whether there is unimpeded access to the rotating hard 
disk(s). If both the hard disk drives and the second stag- 
ing area are available, then in step 503 data is trans- 
ferred from the second data staging area to the first hard 
disk partition area. 

[0072] If the physical rotating hard disk is already be- 
ing accessed for a write operation, then a transfer from 
the solid state device to the rotating hard disk cannot be 
made. Both the rotating hard disk must be available for 
writing, and the solid state data storage devices must 
be available for reading in order to complete step 503 
of copying the data in the second data staging area to 
the first hard disk partition area. • 
[0073] Referring to Fig. 6 herein, there is illustrated 
schematically a second storage operation of the data 
• storage system carried out contemporaneously with the 
processes described with reference to Figs. 3-5 herein, 
for writing a second block of data to the data storage 
system. 

[0074] An equivalent process to that described with 
reference to Figs. 3 to 5 occurs during a write operation 
of a block of data from the host device, whereby the sec- 
ond block of data is written to the second user data area 
302 of the second solid state data storage device, and 
to the first data staging area 301 of the first solid state 
data storage device. The data written into the first data 
staging area is periodically backed up to a second par- 
tition memory area 600 on the plurality of disk drives 
201-204. 

[0075] Therefore, the first data staging area 301 has 
written to it the same data as the second user data area 
302, the second data staging area 303 has the same 



data written to it as the first user data area 300, the sec- 
' ond data staging area 303 is backed up onto the first 
hard disk partition area 304; and the first data staging 
area 304 is backed up onto the second hard disk parti- 
5 tion area 600. 

[0076] Referring to Fig. 7 herein, there is illustrated 
schematically a second write process carried out by 
controller device 207. In step 700, the second data block 
is received by the controller device from the host device. 

10 in step 701 , the received block of data is written to the 
second user data area of the second solid state memory 
device. Contemporaneously with step 701 , the same 
block of data is written to the first data staging area 301 
of the first solid state memory device in step 702. Simi- 

15 S larly, as with steps 401 and 402 described herein above, 
writing of a block of data to the second user data area 
and the first data staging area is made by partitioning 
the second block of data into segments of data which 
are written alternatively to the second user data area 

20 and first data staging area until the whole of the data 
block has been written. In step 703, the block of data 
having been written to the second user data area and 
the first data staging area, a confirmation signal is re- 
turned to the host device, upon receipt of which the host 

25 device may send a further block of data. 

[0077] Referring to Fig. 8 herein, there^is illustrated 
schematically a background back up operation for back- 
ing up the content of the first data staging area to the 
second hard disk partition area. The process of Fig. 8 is 

30 similar and runs in parallel to the process described 
herein before with reference to Fig. 5. In step 800, the 
processor 207 determines whether new data has been 
written to the first staging area. 

[0078] In step 801 , it is checked whether the first stag- 

35 ing area is available for reading. In step 802, it is 
checked whether the rotating hard diskdrive(s) have un- 
impeded write access. In step 803, provided the second 
solid state data storage device is available for reading, 
and the one or more rotating disk drives are available 

40 for writing, then data is transferred from the first data 
staging area in the second solid state data storage de- 
vice on to the second hard disk partition area, thereby 
emptying the data in the first data staging area on to the 
rotating hard disk drives. 

45 [0079] In the best mode, staging areas are is needed 
because writing to a single solid state data storage de- 
vice would give a single point of failure without there be- 
ing parallel writing to another solid state memory stor- 
age device. It is important that at every stage, there are 

so always two copies of the data. In the best mode, the two 
copies of each segment of a first block of data exist ei- 
ther: 

• „• In a user data area of a solid state data storage de- 
55 vice, and in a data staging area of the other solid 
state data storage device; or 
• In the first user data area of the first solid state data 
storage device and in the first data partition area of 
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the plurality of disks. 

[0080] In parallel with this, a second block of data is 
stored in the first and second solid state data storage 
devices, and this has at least two copies stored at ail 
times including: 

• The second block of data stored in the first data 
staging area of the first solid state device and the 
second user data area of the second solid state data 
storage device; or 

• The second data block stored in the second user 
data area of the second solid state data storage de- 
vice and the second disk partition area of the plu- 
rality of hard disks. 

[0081] Between the two solid state data storage de- 
vices, the first block of data is mirrored, and additionally 
the second block of data is mirrored. The rate at which 
the data staging area can be emptied is limited by the 
rate at which the data can be transferred and written to 
the hard disk. 

[0082] A large amount of data is built up in the user 
data areas of the solid state data storage devices. In an 
embodiment in which the controller writes to the two sol- 
id state devices on a single bus, forf requently used data, 
the access time to a host computer is therefore twice 
the access time of each of the solid state devices, of 
approximately 10-20 us, being a significant improve- 
ment on the millisecond access times of prior art hard 
disk storage arrays. Single point of failure is eliminated 
by the use of a pair of solid state data storage devices. 
[0083] For a best mode embodiment in which the con- 
troller has a separate bus to each solid state device, da- 
ta can be written in parallel to the two solid state devices, 
and the access time is equivalent to the access time of 
one solid state device. 

[0084] For less frequently used data, occasionally the 
host will need to access the hard disk drives, in which 
case access times will be slower. However, on average 
the average access time for data will improve compared 
to the prior art RAID case. The size of the first and sec- 
ond data staging areas 304, 600, are preferably large 
enough to contain all the data in the first and second 
user data areas 300, 302. With current solid state data 
storage technology each non-volatile solid state device 
will typically have a data capacity of the order of 
2Gbytes. 

[0085] Because the solid state data storage devices 
have no moving parts, their reliability can be expected 
to be greater than the reliability of a conventional rotat- 
ing hard disk drive having moving parts. However, even 
though the probability of failure of a solid state device is 
lower than that of a rotating hard disk drive, redundancy 
by provision of two solid state data storage devices is 
still desirable to avoid a single point of failure. 
[0086] Data stored on the plurality of solid state data 
storage devices and in the partition area of the rotating 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



disk drive memories need not be redundancy encoded, 
."since physical redundancy -exists by virtue 'of thtfcopiesi 
of data being present on either two solid state data stor- 
age devices and/or a solid state data storage device and 
a hard disk partition area. By removing the need for re- 
dundancy coding, access times are significantly im- 
proved to the data storage device. Calculation of error 
correction codes incurs a time penalty which is signifi- 
cant compared to the write time to the solid state data 
storage devices. This is not the case for writing to rotat- 
ing disk drive devices, where the error correction code 
times are small compared to the write times to the rotat- 
ing disk drives. The control device 207 becomes aware 
of any failures of a said solid state data storage device 
by means of a conventional command-acknowledge- 
ment protocol as is known in the art. 



Claims 

1- A data storage system comprising: 

at least one rotating disk data storage device; 

a plurality of solid state data storage devices, 
each having a data capacity less than a data 
capacity of said rotating disk data storage de- 
vice; 

wherein each solid state data storage device is 
divided into a data staging area and a user data 
area; and 

said plurality of data storage devices are ar- 
ranged such that a set of data is written to at 
leasttwo of said plurality of solid state datastor- 
age devices; and 

data is copied from a said data staging area of 
a said solid state data storage device into said 
rotating disk data storage device. 

2. The data storage system as claimed in claim 1 , 
comprising a controller device comprising a proces- 
sor and a computer program for controlling said 
processor, said controller device operating to: 

receive data from a host device; 

write said data to said plurality of solid state da- 
ta storage devices, such that a copy of said data 
is stored on each said solid state data storage 
device; and 

transfer data from a said data staging area of 
at least one said solid state data storage device 
to a data partition area of said at least one ro- 
tating disk data storage device. 
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3. The data storage system as claimed in claim 1, 

■i <v -, W herein a said solid state data storage device com- ■ •■• 
prises a magnetic random access memory device. 

4. The data storage system as claimed in claim 1, 5 
wherein said solid state data storage device com- 
prises at least one intrinsically volatile dynamic ran- 
dom access memory, and at least one battery de- 
vice, providing a combined non-volatile solid state 
data storage device. io 

5. The data storage system as claimed in claim 1, 
wherein said solid state data storage device com- 
prises a non-volatile flash electrically programma- 
ble read only memory. 1 $ 

6. The data storage system as claimed in claim 1, 
wherein said solid state data storage device com- 
prises a ferro electric random access memory de- 
vice. 20 

7. The data storage system as claimed in claim 1, 
wherein said solid state data storage devices have 
a memory capacity of greater than 1 Gbyte. 

25 

8. The data storage system as claimed in claim 1, 
wherein said solid state data storage devices are 
non-volatile devices. 

9. The data storage system as claimed in claim 1, 30 
wherein a said data staging area comprises less 
than 50% of the data storage capacity of a corre- 
sponding said solid state data storage device. 

10. The data storage system as claimed in claim 1, 35 
. comprising a controller device comprising a proces- 
sor and a computer program for controlling said 
processor, and a plurality of data buses connecting 
said controller with said plurality of solid state data 
storage devices, wherein each said solid state data 40 
storage device communicates with said controller 
device by a corresponding respective separate said 
bus. 

11. A method of storing data in a data storage system *s 
comprising a plurality of rotating disk drive data stor- 
age devices and a plurality of solid state data stor- 
age devices, said method comprising the steps of: 

writing a first block of data to a first user data so 
area in a first said solid state data storage de- 
vice; 

writing said first block of data to a second data 
staging area in a second said solid state data ss 
storage device; 

writing said data stored in said second data 



staging area of said second solid state data 
storage device to a first -data partition area in 
said plurality of rotating disk drives. 

12. The method as claimed in claim 1 1 , further compris- 
ing the steps of: 

writing a second block of data to a first data 
staging area of said first solid state data storage 
device; 

writing said same second block of data to a sec- 
ond user data storage area of said second solid 
state data storage device; and 

writing said second block of data from said first 
data staging area of said first solid state data 
storage device to a second partition memory ar- 
ea of said plurality of rotating disk drives. 

13. The method as claimed in claim 11,* wherein said 
step of writing data to said second data staging area 
and said first user data area comprises: 

(i) receiving said first block of data in a plurality 
of data segments, each data segment being 
smaller than said first block of data; 

(ii) writing a said segment to said second data 
staging area and writing said data segment to 
said first user data area substantially simulta- 
neously; 

(iii) after writing said segment to said second 
data staging area and said first user data area, 
selecting a new segment; 

(iv) repeating steps (ii) to (iii) until all said seg- 
ments of said data first block are written; and 

(v) generating a confirmation signal for confirm- 
ing that said first data block has been written. 

14. The method as claimed in claim 11 , comprising the 
steps of: 

determining whether new data has been written 
into said second data staging area; and 

if new data has been written to said second data 
staging area, copying data of said second data 
staging area to said first data partition area of 
said plurality of rotating disks. 

15. The method as claimed in claim 11 , comprising the 
steps of: 

determining whether new data has been written 
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into said second data staging area; 

determining whether said second data staging 
area is available for reading; 

5 

determining whether unimpeded access to said 
first data partition area of said plurality of rotat- 
ing disks is available; and 

copying data of said second data staging area 10 
to said first data partition area of said plurality 
of rotating disks. 



16. The method as claimed in claim 11, wherein said 
step of writing data to said first data staging area 15 
and said second user data area comprises: 

(vi) receiving said second block of data in a plu- 
rality of data segments, each data segment be- 
ing smaller than said second block of data; 20 

(vii) writing a said segment to said first data 
staging area and writing said segment to said 
second user data area substantially simultane- 
ously; 25 



19. The method as claimed in claim 11 , comprising the 
"•■>•* steps of: ' ^^ri;^ : v?V'>>^,;''. -v. - ~.,s> 

determining whether new data has been written 
into a first data staging area; 

determining whether said first data staging area 
is available for reading; 

determining whether unimpeded access to a 
second data partition area of said plurality of 
rotating disks is available; and 

copying data of said first data staging area to 
said second data partition area of said plurality 
of rotating disks. 



(viii) after writing said segment to said first data 
staging area and said second user data area, 
selecting a new segment; 

(ix) repeating steps (vii) to (viii) until all said seg- 
ments of said second data block are written; 
and 



30 



(x) generating a confirmation signal for confirm- 
ing that said second data block has been writ- 
ten. 



35 



17. The method as claimed in claim 11, comprising the 
steps of: 

determining whether new data has been written 
into a first data staging area; and 
if new data has been written to said first data 
staging area, copying said data of said first data 
staging area to a second data partition area of 
said plurality of disks. 

18. The method as claimed in claim 11 , comprising the 
steps of: 
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determining whether new data has been written 
into a first data staging area; and 

if new data has been written to said first data 
staging area, copying data of said first data 
staging area to said second data partition area 
of said plurality of rotating disks. 
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Write block of data to first user data area 
of first solid state memory device 



Write same block of data to second staging 
data area of second solid state memory 
device 
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Receive second data block from host 




700 



Write second block of data to first user data 
area of second solid state memory device 



Write same second block of data to first 
staging data area of first solid state 
memory device 



Send confirmation signal to host device 
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